Evolution of Wikipedia's Category Structure

نویسندگان

  • Krzysztof Suchecki
  • Alkim Almila Akdag Salah
  • Cheng Gao
  • Andrea Scharnhorst
چکیده

The e-Humanities Group, Royal Netherlands Academy of Arts and Sciences (KNAW) Joan Muyskenweg 25, 1096 CJ Amsterdam, The Netherlands and Erasmus Virtual Knowledge Studio, Erasmus University Rotterdam Burgemeester Oudlaan 50, 3062 PA Rotterdam, The Netherlands currently: IFISC, Instituto de F́ısica Interdisciplinar y Sistemas Complejos (CSIC-UIB), Campus Universitat Illes Balears, E-07122 Palma de Mallorca, Spain [email protected]

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Exploring Wikipedia's Category Graph for Query Classification

Wikipedia’s category graph is a network of 400,000 interconnected category labels, and can be a powerful resource for many classification tasks. However, its size and the lack of order can make it difficult to navigate. In this paper, we present a new algorithm to efficiently explore this graph and discover accurate classification labels. We implement our algorithm as the core of a query classi...

متن کامل

Using Wikipedia's Category Structure for Entity Search

In this paper we investigate how the category structure of Wikipedia can be exploited for Entity Ranking. In the last decade, the Web has not only grown in size, but also changed its character, due to collaborative content creation and an increasing amount of structure. Current Search Engines find Web pages rather than information or knowledge, and leave it to the searchers to locate the sought...

متن کامل

Extending a multilingual Lexical Resource by bootstrapping Named Entity Classification using Wikipedia's Category System

Named Entity Recognition and Classification (NERC) is a well-studied NLP task which is typically approached using machine learning algorithms that rely on training data whose creation usually is expensive. The high costs result in the lack of NERC training data for many languages. An approach to create a multilingual NE corpus was presented in Wentland et al. (2008). The resulting resource call...

متن کامل

Wikipedia as an Ontology for Describing Documents

Identifying topics and concepts associated with a set of documents is a task common to many applications. It can help in the annotation and categorization of documents and be used to model a person's current interests for improving search results, business intelligence or selecting appropriate advertisements. One approach is to associate a document with a set of topics selected from a fixed ont...

متن کامل

Texture Evolution in Low Carbon Steel Fabricated by Multi-directional Forging of the Martensite Starting Structuree

It has been clarified that deformation and annealing of martensite starting structure can produce ultrafine-grained structure in low carbon steel.  This study aims to investigate the texture evolution and mechanical properties of samples with martensite structure deformed by two different forging processes. The martensitic steel samples were forged by plane strain compression and multi-directio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1203.0788  شماره 

صفحات  -

تاریخ انتشار 2012